NESP: Nonlinear enhancement and selection of plane for optimal segmentation and recognition of scene word images

نویسندگان

  • Deepak Kumar
  • M. N. Anil Prasad
  • A. G. Ramakrishnan
چکیده

In this paper, we report a breakthrough result on the difficult task of segmentation and recognition of coloured text from the word image dataset of ICDAR robust reading competition challenge 2: reading text in scene images. We split the word image into individual colour, gray and lightness planes and enhance the contrast of each of these planes independently by a power-law transform. The discrimination factor of each plane is computed as the maximum between-class variance used in Otsu thresholding. The plane that has maximum discrimination factor is selected for segmentation. The trial version of Omnipage OCR is then used on the binarized words for recognition. Our recognition results on ICDAR 2011 and ICDAR 2003 word datasets are compared with those reported in the literature. As baseline, the images binarized by simple global and local thresholding techniques were also recognized. The word recognition rate obtained by our non-linear enhancement and selection of plance method is 72.8% and 66.2% for ICDAR 2011 and 2003 word datasets, respectively. We have created ground-truth for each image at the pixel level to benchmark these datasets using a toolkit developed by us. The recognition rate of benchmarked images is 86.7% and 83.9% for ICDAR 2011 and 2003 datasets, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

NESP: Nonlinear enhancement and selection of plane for optimal segmentation and recognition of scene word images

Program Committee Gady Agam, Illinois Institute of Technology (United States); Elisa H. Barney Smith, Boise State Univ. (United States); William A. Barrett, Brigham Young Univ. (United States); Kathrin Berkner, Ricoh Innovations, Inc. (United States); Hervé Déjean, Xerox Research Ctr. Europe Grenoble (France); Xiaoqing Ding, Tsinghua Univ. (China); David Scott Doermann, Univ. of Maryland, Colle...

متن کامل

Automatic Prostate Cancer Segmentation Using Kinetic Analysis in Dynamic Contrast-Enhanced MRI

Background: Dynamic contrast enhanced magnetic resonance imaging (DCE-MRI) provides functional information on the microcirculation in tissues by analyzing the enhancement kinetics which can be used as biomarkers for prostate lesions detection and characterization.Objective: The purpose of this study is to investigate spatiotemporal patterns of tumors by extracting semi-quantitative as well as w...

متن کامل

A Hybrid Algorithm based on Deep Learning and Restricted Boltzmann Machine for Car Semantic Segmentation from Unmanned Aerial Vehicles (UAVs)-based Thermal Infrared Images

Nowadays, ground vehicle monitoring (GVM) is one of the areas of application in the intelligent traffic control system using image processing methods. In this context, the use of unmanned aerial vehicles based on thermal infrared (UAV-TIR) images is one of the optimal options for GVM due to the suitable spatial resolution, cost-effective and low volume of images. The methods that have been prop...

متن کامل

A Novel Spot-Enhancement Anisotropic Diffusion Method for the Improvement of Segmentation in Two-dimensional Gel Electrophoresis Images, Based on the Watershed Transform Algorithm

Introduction Two-dimensional gel electrophoresis (2DGE) is a powerful technique in proteomics for protein separation. In this technique, spot segmentation is an essential stage, which can be challenging due to problems such as overlapping spots, streaks, artifacts and noise. Watershed transform is one of the common methods for image segmentation. Nevertheless, in 2DGE image segmentation, the no...

متن کامل

A Novel Fuzzy-C Means Image Segmentation Model for MRI Brain Tumor Diagnosis

Accurate segmentation of brain tumor plays a key role in the diagnosis of brain tumor. Preset and precise diagnosis of Magnetic Resonance Imaging (MRI) brain tumor is enormously significant for medical analysis. During the last years many methods have been proposed. In this research, a novel fuzzy approach has been proposed to classify a given MRI brain image as normal or cancer label and the i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013